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1 Focusing processor policies via critical-path prediction 
Brian Fields, Shai Rubin, Rastislav Bodik 

May 2001 ACM SIGARCH Computer Architecture News , Proceeding 
annual international symposium on Computer architecture '. 
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Publisher: ACM Press 

Full text available: ^pdffl.10 Additional Information: full citation , abst 
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Although some instructions hurt performance more than others, current j 
apply scheduling and speculation as if each instruction was equally costl 
can be naturally expressed through the critical path : if we could predict i 
egalitarian policies could be replaced with cost-sensitive strategies that v 
increasingly effective as processors become more parallel. 

This paper introduces a hardware predictor of instruction critic ... 
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This paper describes a method for compiling programs using interproced 
allocation. A strategy for handling programs built from multiple module: 
well as algorithms for global variable promotion and register spill code r 
algorithms attempt to address some of the shortcomings of previous intei 
allocation strategies. Results are given for an implementation on a single 
based architec ... 

4 Broadcast protocols to support efficient retrieval from databases by mobile 
^ Anindya Datta, Debra E. VanderMeer, Aslihan Celik, Vijay Kumar 

March 1999 ACM Transactions on Database Systems (TODS), Volume 
Publisher: ACM Press 

Full text available: ^ pdf(638.48 Additional Information: full citation , abst 
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Mobile computing has the potential for managing information globally. 1 
issues in mobile computing have received some attention in recent times 
adaptive braodcast protocols has been posed as an important probllem. S 
employed by database servers to decide on the content of bbroadcasts dy 
response to client mobility and demand patterns. In this paper we design 
also propose efficient retrieval s ... 

Keywords: adaptive broadcast protocols, client-server computing, energ 
mobile databases 
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Several cache-coherent shared-memory multiprocessors have been devel 
scalable and offer a very tight coupling between the processing resource: 
quite attractive for use as compute servers for multiprogramming and pa: 
workloads. Process scheduling and memory management, however, rem; 
to the distributed main memory found on such machines. This paper exa: 
OS scheduling and page migration policies on the perfo ... 
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May 1999 ACM SIGARCH Computer Architecture News , Proceeding 
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KB) W Additional Information: full citation , abst 
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Site 

By optimizing data layout at run-time, we can potentially enhance the pe 
by actively creating spatial locality, facilitating prefetching, and avoiding 
and false sharing. Unfortunately, it is extremely difficult to guarantee ths 
optimizations are safe in practice on today's machines, since accurately i 
to an object requires perfect alias information, which is well beyond the 
compiler for languages such as C. T ... 
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Fernando Pereira, Naftali Tishby, Lillian Lee 

June 1993 Proceedings of the 31st annual meeting on Association for C 
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We describe and evaluate experimentally a method for clustering words 
distribution in particular syntactic contexts. Words are represented by th< 
distributions of contexts in which they appear, and relative entropy betw 
distributions is used as the similarity measure for clustering. Clusters are 
average context distributions derived from the given words according to 
of cluster membership. In many cases, the cluster ... 

8 A performance evaluation of cluster architectures 
<^ Xiaohan Qin, Jean-Loup Baer 

June 1997 ACM SIGMETRICS Performance Evaluation Review , Pro< 
1997 ACM SIGMETRICS international conference on Meas 
modeling of computer systems SIGMETRICS '97, Volume 1 

Publisher: ACM Press 

Full text available: ^pdf(1.65 Additional Information: full citation , abst 
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This paper investigates the performance of shared-memory cluster-based 
each cluster is a shared-bus multiprocessor augmented with a protocol pi 
cache coherence across clusters. For a given number of processors, sixte- 
evaluate the performance of various cluster configurations. We also cons 
adding a remote shared cache in each cluster. We use Mean Value Anal} 
cache miss latencies of various types and th ... 

9 Slack: maximizing performance under technological constraints 
^ Brian Fields, Rastislav Bodik, Mark D. Hill 

May 2002 ACM SIGARCH Computer Architecture News , Proceeding 
annual international symposium on Computer architecture ] 
Proceedings of the 29th annual international symposium on 
architecture ISCA f 02, Volume 30 Issue 2 

Publisher: IEEE Computer Society, ACM Press 

Full text available: 1pdf( 1.34 
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Many emerging processor microarchitectures seek to manage technologi 
wire delay, power, and circuit complexity) by resorting to non-uniform d 
resources at multiple quality levels (e.g., fast/slow, bypass paths, multi-s 
units, and grid architectures). In such designs, the constraint problem be< 
problem, and the challenge becomes designing a control policy that mitij 
performance penalty of the non-uniformit ... 

Keywords: critical path, slack, performance analysis, program behavior, 
technological constraints, wire delay, power, circuit complexity, clusters 



10 Analysis of benchmark characteristics and benchmark performance predict 
^ Rafael H. Saavedra, Alan J. Smith 

November 1996 ACM Transactions on Computer Systems (TOCS), Vc 

Publisher: ACM Press 

Full text available: * ^pdf(1.02 Additional Information: full citation , abst 
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Standard benchmarking provides to run-times for given programs on giv 
fails to provide insight as to why those results were obtained (either in te 
program characteristics) and fails to provide run-times for that program ( 
machine, or some other programs on that machine. We have developed a 
imdependent model of program execution to characterize both machine i 
program execution. By merging these machine and program characteriz 

Keywords: abstract machine performance model, benchmark analysis, e 
prediction, microbenchmarking 
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In a ten-session experiment, six participants practiced typing with an exp 
method on an optimized virtual keyboard. Based on a large amount of in 
data, this paper reports the following findings. First, the Fitts-digraph mc 
model of virtual keyboards is revised. The format and parameters of Fitt: 
previously in virtual keyboards research were incorrect. Second, perfonr 
predictions of various layouts are calculated with the new m ... 

Keywords: Fitts' law, expanding rehearsal, graphical keyboard, learning 
computing, on screen keyboard, skill acquisition, soft keyboard, text enti 
keyboard 



12 Trading quality for compile time: ultra-fast placement for FPGAs 
^ Yaska Sankar, Jonathan Rose 

February 1999 Proceedings of the 1999 ACM/SIGDA seventh internath 
Field programmable gate arrays 
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Full text available: Hpdf(1.21 Additional Information: full citation , refe: 
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13 Efficient placement algorithms optimizing delay for high-speed ECL mastt 
Yasushi Ogawa, Tatsuki Ishii, Yoichi Shiraishi, Hidekazu Terai, Tokinori ] 
Yuyama, Kyoji Chiba 

July 1986 Proceedings of the 23rd ACM/IEEE conference on Design ai 
Publisher: IEEE Press 

Full text available: ^ pdf(6 10.76 Additional Information: full citation , abst 

KB) citings , index ten 

Placement algorithms optimizing signal delay as well as wirability for hi 
masterslice LSI's are proposed. Equivalent constraints of wire length for 
path delay, and wired-OR are classified according to upper and lower lin 
such limits, a top-down method utilizing an augmented two-dimensional 
placement with "scope" and "zone", which are new concepts representin 
iterative weighted improvement method ... 
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14 Research sessions: query processing II: Efficient k-NN search on vertically 
^ Arjen P. de Vries, Nikos Mamoulis, Niels Nes, Martin Kersten 

June 2002 Proceedings of the 2002 ACM SIGMOD international confei 

Management of data SIGMOD '02 
Publisher: ACM Press 

Full text available: * ^pdf(L26 Additional Information: full citation , abst 

MB) citings , index ten 

Applications like multimedia retrieval require efficient support for sirnih 
data collections. Yet, nearest neighbor search is a difficult problem in hij 
spaces, rendering efficient applications hard to realize: index structures c 
increasing dimensionality, while sequential search is not an attractive sol 
repositories with millions of objects. This paper approaches the problem 
angle. A solution is sought in an unconvent ... 

15 High-cost CFD on a low-cost cluster 

Thomas Hauser, Timothy I. Mattox, Raymond P. LeBeau, Henry G. Dietz, 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Sut 
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Publisher: IEEE Computer Society 

Full text available: IpdfROO 

MB) H Additional Information: full citation , abst 
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Direct numerical simulation of the Navier-Stokes equations (DNS) is an 
for the future of computational fluid dynamics (CFD) in engineering app 
DNS requires massive computing resources. This paper presents a new a 
implementing high-cost DNS CFD using low-cost cluster hardware. Aft* 
DNS CFD code DNSTool, the paper focuses on the techniques and tools 
developed to customize the performance of a cluster ... 

16 On balancing the load in a clustered web farm 
<g> Joel L. Wolf, Philip S. Yu 

November 2001 ACM Transactions on Internet Technology (TOIT), V 
Publisher: ACM Press 

Full text available: H pdf(6 12.40 Additional Information: full citation , abst 



http://portal.acm.org/results.cfm?CFID=1303502&CFTOKEN=742... 7/17/06 



Results (page 1): +clustering +optimiz* average normalized Page 8 of 10 

KB) citings , index ten 

In this article we propose a novel, yet practical, scheme which attempts t 
the load on the servers of a clustered Web farm. The goal in solving this 
problem is to achieve minimal average response time for customer reque 
ultimately achieve maximal customer throughput. The article decouples 
into two related but distinct mathematical subproblems, one static and or 
believe this natural decoupling is one of the major contrib ... 

Keywords: Clustered Web farms, combinatorial optimization, load balai 
allocation problems 



17 System-level power optimization: techniques and tools 
^ Luca Benini, Giovanni de Micheli 

April 2000 ACM Transactions on Design Automation of Electronic Sys 

Volume 5 Issue 2 
Publisher: ACM Press 

Full text available: " B pdf(385.22 Additional Information: full citation , abst 
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This tutorial surveys design methods for energy-efficient system-level d< 
electronic sytems consisting of a hardware platform and software layers, 
three major constituents of hardware that consume energy, namely comp 
communication, and storage units, and we review methods of reducing tl 
consumption. We also study models for analyzing the energy cost of sofi 
for energy-efficient software design and compilation. This survery ... 

18 Reducing memory latency via non-blocking and prefetching caches 
^ Tien-Fu Chen, Jean-Loup Baer 

September 1992 ACM SIGPLAN Notices , Proceedings of the fifth intei 
conference on Architectural support for programming 
operating systems ASPLOS-V, Volume 27 Issue 9 
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19 Concepts and effectiveness of the cover-coefficient-based clustering methc 
^ databases 

Fazli Can, Esen A. Ozkarahan 

December 1990 ACM Transactions on Database Systems (TODS), Vol 
Publisher: ACM Press 

Full text available: " B pdf(2.74 Additional Information: full citation , abst 
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A new algorithm for document clustering is introduced. The base concer, 
the cover coefficient (CC) concept, provides a means of estimating the n 
within a document database and related indexing and clustering analytic; 
concept is used also to identify the cluster seeds and to form clusters witi 
shown that the complexity of the clustering process is very low. The retr 
show that the information-retrieval effectiv ... 

Keywords: cluster validity, clustering-indexing relationships, cover coel 
coefficient, document retrieval, retrieval effectiveness 
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An-Chow Lai, Babak Falsafi 

July 2000 Proceedings of the twelfth annual ACM symposium on Paral 
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Publisher: ACM Press 

Full text available: ^ pdf( 105. 85 Additional Information: full citation , abst 
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In this paper, we compare and contrast two techniques to improve capaci 
traffic in CC-NUMA DSM clusters. Page migration/replication optimize 
accesses to a page used by a single processor by migrating the page to th 
replicates all read-shared pages in the sharers' local memories. R-NUM^ 
write accesses to any page by allowing a processor to cache that page in 
Page migration/replication requires less hardware c ... 
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